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A survey is given of some of the methods used to prove various clas¬ 
sical inequalities. 


§1. Introduction 

The object of this paper is to interest the reader in the topic of in¬ 
equalities. It is a subject that requires only the most basic mathematical 
knowledge and skills, although more sophisticated ideas can play a part 
as well. While it is probably true to say that any mathematician can read 
a paper on inequalities, it is hoped that this survey will persuade others, 
who are too modest to call themselves mathematicians, that they too can 
read such papers, and further that they can add to our knowledge of the 
subject. 

The results to be discussed are all classical, and can be found in most 
of the standard works listed in the references;[1]—[5], [7],[8],[10],[11]. They 
will be used to illustrate methods of proof, to show how simple inequalities 
can be generalized, and the ability of inequalities to assume almost inpen- 
etrable disguises; that is, two inequalities that are in fact identical may 
look very different; further to this see [3], p.139, and p.155. Most of the 
time we will use elementary algebra, and sometimes elementary calculus, 
yet the results discussed are of extreme importance, and are of great use 
in applications; see [8],[14] and section 4 below. 


* This paper is based on talks given at the University of Malaya, Kuala Lumpur, and Chula- 
longkorn University, Bangkok, in the spring of 1993. The author wishes to thank the mathe¬ 
matics departments of those universities for their invitations. 
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§2. Bernouilli’s Inequality 

Consider the following statement: 


o>0, /? > 0, n > 1 =► > na — (n — l)/3; (1) 

and the two sides are equal only if either n = 1 or a = /?. 

Here, as throughout, we are making statements about real numbers, 
and in addition we require n to be a positive integer. So (1) is short for: 
for all positive a, /?, and for all positive integers n the inequality in (1) 
holds, further it is strict except in the two situations mentioned. 

How can this be proved? One standard technique is to re-write and, or 
re-interpret an inequality such as (1) and see if this gives us any idea of how 
to proceed. We will do this in two ways; the first following Nanjundiah, 
[12], and secondly in a way that is more standard. 

The advantages of several proofs of the same statement are various; 
it may give more insight into just what is being said, and perhaps when 
trying to generalize a statement only one of the proofs will suggest how 
to do this. 

Nanjundiah’s approach is to re-interpret (1) as a statement about two 
sequences. If 


a o: 2 a 3 a n 

L is the sequence : Za L 2 = —, L 3 = L n = 

and, 

R is the sequence : R\ = a, R 2 = 2a — /3 , R 3 = 3a — 2/3 ,..., 

R n = na — (n — 1)/?,... 

then inequality (1) just says that the sequence L dominates the sequence 
R ; that is 

Ln ^ Rni n = 1 , 2 ,... (2) 

or even more shortly. 

L>R. (3) 

Now since L\ = R\ both sequences start at the same place, so (2), or (3), 
will certainly hold if each move along the sequence L is at least as big as 
the corresponding move along the sequence R ; that is if 


Ln L n —1 ^ Rn Lt n —1 n — 2,3,... 
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or equivalently if 


(L n — L n _i) — (i2„ — i? n _i) > 0 n —2,3,... 


It is easily seen see that 


(4) 


L n L n -\ — p n ( a P)i 
R n - R n -i =(a - ft). 

So, substituting in the left hand side of (4), 


(L n ~ in-l) - (Rn ~ Rn- 1) = - l) (a - ft ). 

Since either a > ft, when the last two factors are both positive, or a < ft, 
when the last two factors are both negative, or a = ft when the last both 
factors are zero, this leads to (4). 

So we have proved (1), together with the cases of equality. 

Another method is to re-write inequality (1) to see that it is nothing 
else but a disguised form of Bernouilli’s Inequality , a result that goes back 
to the seventeenth century. To see this let us rearrange and generally play 
around with inequality (1) to see what we get. At each stage we get a 
different looking, but completely equivalent inequality. 

(i) Division by ft > 0 is possible and will not change the direction of the 
inequality. Hence: 

>na-(n-l)ft ^ (j) >n(j)-(n-l). 

This shows that our inequality does not depend on two variables at 
and ft but only on their ratio a/ft. 

(ii) Let us recognise this by putting x = ot/ft to get: 

x n > nx — (n — 1) 
x n > n(x — 1) + 1; 

The conditions on a, ft translate into x > 0, and the conditions for 
equality into n = 1, or x = 1. 


a 


0 


— >not-(n-l)ft 
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(iii) Now why not see what happens if we simplify the right hand side of 
the last expression by putting y = x — 1. This results in the following 
statement that is equivalent to that in (1): 

y>- 1 => (l + y)”>l + ny, (B) 

with equality only if either n = 1 or y = 0. 

This is the well-known Bernouilli Inequality; and there are many ways 
to prove it as can be in the references [1], [3] or [10]. 

One way of looking for a proof of an inequality such as (B) is to 
experiment with it by checking special cases. Let us do that, seeing what 
happens when n = 1,2,3,.... 

First note that if n = 1 then both sides of (B) are the same, while if 

n = 2 : left hand side of (B) = (1 + y) 2 

= 1 + 2 y + y 2 

> 1 + 2y = the right hand side of (B). 

Note that so far we have not needed y > — 1; also, in the case n = 2 
equality only occurs if y = 0, as stated. If now, 

n = 3 : left hand side of (B) = (1 + y) 3 

= 1 + 3y + y 2 (y + 3) 

> 1 + 3y = the right hand side of (B). 

provided y > —3, and again there is equality only if y = 0. 

This suggests we apply the Binomial Theorem in the general case; if 

n > 2 : left hand side of (B) = (1 + y) n 

1 > i 2 / 1) . n-2\ 

= 1 + ny + y { ---1-+ y ) 

> 1 + ny = the right hand side of (B), 

provided the last bracket is non-negative. However it is not easy to see 
when this is so. We can only state with certainty that this bracket is 
non-negative if y >0, which is not quite the statement we would like. 

Since the Binomial Theorem is proved by mathematical induction, 
this “almost proof” is also an example of proof by induction. If we are 
a little more careful we can use induction to get the right answer. So let 
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us continue our experiments. Consider the case n = 4, and assume as we 
may that y ^ 0; if 

n = 4 : left hand side of (B) = (1 + y) 4 

= (l + y) 3 (l + y) 

> (1 + 3y)(l + y), by the n = 3 case 
just proved, and provided 1 + y > 0 ,» 
= 1 + 4y + 3y 2 > 1 + 4y 

= right hand side of (B); 


which gives proof under the right conditions. 

A similar argument can be given if n = 5 and y > — l,y ^ 0, but 
appealing to the case n = 4 that has just been proved. 

This can now be formalised into a proof by induction of (B). 

A question remains: is the condition y > — 1 as result of our method 
of proof or is it essential? A simple example shows that it is necessary: 

-32 = (—2) 5 = (1 + (—3)) 5 < 1 + 5(—3) = -14. 


It would appear that none of the above approaches to (B) suggests how to 
prove the obvious generalisation in which the positive integer n is replaced 
by an arbitrary real number r. However note that in such an extension 
the condition y > — 1 would be automatically required since the general 
rth power is not defined for zero or negative numbers. We clearly get 
equality if r = 0 or r = 1, the last case being covered by (B) itself; but 
what is true otherwise, r = 1/2 or r = —1/3 say? Since rth powers are 
more sophisticated than ordinary algebra it is only natural to expect to 
use calculus here. 

If we remember that the Binomial Theorem is a special case of Tay¬ 
lor’s Theorem it is natural to try to use this result to generalize (B). We 
appeal to Taylors’s Theorem with remainder, omitting the trivial cases of 
r = 0,1, and y = 0; then for some z between 0 and y, 

(l + i,) r = l + r i ,+ r(r 2 ~ 1) (l+z)'- :i . (T) 


So, on simple inspection of this we get 


(1 + 



1 + ry, 
1 + ry , 
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if r(r — 1) > 0; 
if r(r — 1) < 0; 



Since the condition r(r — 1) is equivalent to r > 1 or r < 0, and the 

condition r{r — 1) < 0 is equivalent to 0 < r < 1 we get, on collecting the 

trivial cases the following complete generalisation of (B): 

{ > 1 + ry if r > 1 or r < 0, 

<l + ry if 0 < r < 1, ( B g ) 

= 1 + ry if r = 1 or r = 0. 

However we do not need to use Taylor’s Theorem; more elementary cal¬ 
culus will suffice. Consider the following function 

/(y) = (i + y) r -(i + n/), v>-i» 

and assume that r ^ 0,1. 

Easy calculations will show that /(0) = /'(0) = 0, and that further 
y = 0 is the only zero of /'; finally /"(0) = r(r — 1). It follows that y = 0 
is the maximum of / if r(r — 1) > 0, and the minimum of / if r(r — 1) < 0; 
so in the first case / is a negative function except at y = 0, while in the 
second case / is a positive function except at y = 0. This gives another 
proof of (Bg). 

Of course, by reversing the steps that led to (B) from (1) we can get, 
from (B g ) the following generalization of (1). 

o;>0, /3>0, =* > ra - (r - 1)0; (l g ) 

if r > 1 or r < 0 ;*/ 0 < r < 1 the oppposite inequality holds; there is 
equality only if r = 1, r = 0, or if a = /?. 


§3. The Geometric and Arithmetic Means 


If Oi,..., a n are n positive numbers put a = (dj,..., a n ) and write 


-'‘tn(ll) —5 • • • i ®n) — 


a \ + a 2 + - * • + a n 

n 


G n(^) —— \/CL\CL2 * ’ * ^ n ! 

These quantites are the arithmetic and geometric means of a, respectively. 
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Is it true that these two means are comparable ? That is, is it true 
that for all positve a, and all positive integers n that we either always 
have that 


G n (a) = v^axa 2 


a n < 


a l + a 2 + ’ * • + 


n 


— -A n (a), 


(G-A) 


or always have the opposite inequality? 

A few numerical experiments will suggest that if anything is true then 
it is (G-A), not the opposite, that will hold. Let us see if we can prove 
this by first looking at some special values of n. In the case of n = 2, and 
writing ai = x, a 2 = y, (G-A) is 

x + y 


s/xy < 

As before, let us try rewriting and see if anything is suggested: 

x + y 


(5) 


s/xy < 


2 y/xy < x + y 

0 < x - 2 y/xy + y 
0 < (Vx-y/y) 2 ; 


but this last statement is obvious. Further this argument shows that (5) 
is strict unless x = y. 

However if we now try to get a proof of the case n = 3 in a similar 
way we run into difficulties so let us follow Cauchy and proceed to n = 4; 
there is some merit in this as fourth powers are related to squares in a 
nice way. Put ai = a,a^ = 6 , <23 = c, <14 = d when (G-A) is 

a+b+c+d 

- • ( 6 ) 


\/abed < 


Trying to use what we know rewrite the right hand side of ( 6 ) as follows 
a+b+c+d 


a + b c + d 
——— + —— 


\/ab + \fcd 

>---,by (5), by the case n = 2; 


>^/(\/a 6 )(\/cd), again by (5), the 


case n = 2 ; 


—y/abcd\ 
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which is (6). To get equality we need, from the first use of the n = 2 case 
that a = b and c = d, and from the second use we need that ab = cd; that 
is we need to have a = b = c = d. 

Now we can go back to the case n = 3: 


a+b+c a+6+c+ 


a b -f" c 


> 



a b c 


1/4 

, by (6), the case n = 4. 


This, by simple algebra, is just 



a + b + c 
3 


with equality only if a = b = c. 

This procedure suggests that next we consider the case n = 8, and 
then come back to the cases n = 5, 6, 7, then move to the case n = 16, 
and come back to the cases n — 9,..., 15. It can be checked that this will 
in fact work, the method is sometimes known as backward induction. Of 
course we do not have to prove each case missed separately; for instance 
we can handle an arbitrary k, 9 < k < 15 by writing 


a i +••• + «* 

k 

. / a i + , " + a fc\ . . / a i + ' -- + a fc\ 

a l +- ^ a k + ( l ) +-b ( £ ) 


(7) 


16 


there being 16 — k terms equal to ( a\ H- \-a,k)/k in the numerator on the 

right, giving a sum of 16 terms to which the n = 16 case can be applied. 

So (G-A) holds with equality only when cq = • • • = a n . There are 
many proof of this important result; 52 are given in [3], most of them 
extremely elementary. 

Now the question of generalizations arises. If we rewrite (G-A) as 


A n (a) — G„(a) >0 or 


A n(a) > 1 

Gn(fl) 


(Si 
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then since equality in the inequalities (8) can occur only in the trivial case 
of ai =■■■ — a n we could ask if bigger right hand sides are possible if 
not all of the a;, 1 < i < n are equal? To answer this question we follow 
Nanjundiah, [12] and use (1). 

Let us agree to the notation for 1 < k < n, 

A k (a) = — + k + — G k (a) = (a 1 ---a k ) 1/k . 


If 

a = A n (a ), P = A n _i(a) easy calculations give that na— (n—1 )P = a n ; 


a 


while if 

a' = G n (a), ft' — G n _i(a), easy calculations give that — = a n . 

r' 

This enables us to use (1) twice as follows: 


>na-(n- 1 )P = a n = > na' - (n - l)p'. 

Prom this we can deduce the two inequalities, 

TX / ^ 

and na - (n - 1 )p > na' - (n - 1)/?'.; 

which on rewriting give 

n —1 


( 5 )> 


I — 


and n(a — a') > (n — 1 ){P — P'). 


If the values given to a,a',P,P' are substituted into these last in terms 
the following inequalities, called the inequalities of Popviciu and Rado, 
respectively, are obtained. 


/ A n (g) \ n / >ln-i(a) 

\G n (a)J \G n _i(a 


n — 1 


and 


( 9 ) 


n(j4 n (®) G n — i(o)). 
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Inequalities (9) improve inequalities (8) since unless cq = • • • = a n _i the 
right hand sides are respectively bigger than zero, bigger than 1. In the 
deduction of (9) we used (1) and so there will be equality in (9) only if there 
is equality in the uses of (1); that is only if a = f3 for Popoviciu’s inequality 
and a' = /3' for Rado’s inequality; in other words, unless a n = A n _i(a), 
and a n = G n _i(a), repectively. 

Incidentally not only do (9) improve (G-A), they also give a proof of 
this inequality. Consider the case of the Rado inequality, and re-apply it 
to get 

™(^n(a) - G n (a)) > (n - l)(A n _i(a) - G„-i(a)) 

> (n-2)(A n _ 2 (a)-G n _ 2 (a)) (10) 

> • • • > 2(A 2 (a) — G 2 (a)) > (Aj(a) — Gj(a)) = 0, 

giving (G-A) in the first form in (8). If we stop one stage earlier in (10) 
we get 

A n (a) - G„(a) > ~(A 2 (a ) - G 2 (a )) = ■ 

n n 

However, the order of the terms in a do not matter, so by re-ordering if 
necessary, we get the following improvement of (G-A) 

1 2 
^n(a) - G n (a) > (-) ^ max^(^/a7 — ^/aj) . 


Now let us turn to a generalization of (G-A) suggested by (7); the right 
hand side of (7) can be rewritten as 


i i , fia . \°i + ••• + <** 

a i + • • • + a* + (16 — k) - - - 

16 ’ 

where now the numerator has k + 1 terms but one of the terms is given 
a weight of 16 — k, compared to a weight of 1 for all the other k terms. 
So now we introduce n positive numbers = (uq,... write W n = 

w\ + • • • + Wk, 1 < k < n, and define 


A„(a; w) =A n (ai,...,a n ;uq,...,u> n ) = 


aiwi + a 2 w 2 -1- a n w n 


G n (a;w n ) =G n (ai,... ,a n ]W\, , w n ) = (a^ap •••a" n ) 1/W ”; 
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the arithmetic and geometric means of a with weights w, respectively. The 
natural generalisation of (G-A) is 


O n (a; w) = (a™ 1 


u, n \i/Wn < QiWi + --- + a n u> n 
’ - W n 




(G-A,) 

This can be proved, even improved, by repeating the above argument that 
used (1), but now use (l s ) with 


a = A n (a; w),/3 = A n _i(a; w),a' = G n (a-,w),/3' = G n (a; w), 
and r = W n /w n , when r — 1 = W n -\/w n . 


This leads to the following generalization of (9), 

/ A n (q;w) \ Wn > / A n -i(a; u>) \ Wn ~ 1 
\G n (a;w)J ~\G n -i(a; w)) 
and 

W n (A n (a;w ) - G n (a] w)) >W n - 1 (A„- 1 (a;w) - G n -i(a]w)). 


§4. Applications of the Inequality between the Arithmetic 
and Geometric Means 

(a) The name mean is justified since it is very easy to prove that 

min a < A n (a]w ) < max a min a < G n (a\w ) < max a; (11) 

further these inequalities are strict unless ai = • • • a n . 

(b) Let us agree to a simple extension of our notation, writing a = 
(ai,a 2 ,..., a n ...), and w = (ti>i,u) 2 ,..., w n ,...) for two sequences of pos¬ 
itive numbers; then we will get two sequences of means, 

A = (Ai (a; w), A 2 (a; w),..., A„(a; w ),.. .) 

G = (Gi (a; w), G 2 (a; w),..., G n (a ; w ),...); 

when of course (G-A) just says that the sequence A dominates the sequence 
G, or G < A 

A question that can be asked is whether properties of a sequence are 
inherited by the sequence of its means? One classical theorem of Cauchy 
says that if a sequence converges then so does the sequence of its arithmetic 


58 





means, and further the limit is the same; see [6] p.90. A modification of the 
proof of this result will easily show that the same is true for the geometric 
means; alternatively we could use the simple identity: 


G n (a; w) - exp (a„ (log a; w) V 


( 12 ) 


where we have introduced the useful notation f(a ) = (/(ai), f(a 2 ),..., 

/(an),...); 

Then if limn-^ a n = a, we have from (12), using Cauchy’s result just 
quoted that 

lim G n (a; tw) = exp( lim (A n (loga; uA) ) = exp log a — a. 

ti — kx ) yn—►oo v J 

Another property that is inherited is monotonicity; if a is increasing then 
so is A, and G; further if the sequence is strictly increasing so are the 
mean sequences. First note the useful identities: 


, , s. W n - 1 v , w n 

A„(q;i£) = — A n -i(a; w) + —a„ 

= A 2 (A n _i(a;t£),a n ;TF n _i,u; n ); 
G n (a;w) =G n - 1 (a ] w) w ’'-''w» a ”r>/w n 

= G 2 (Gn-i(aiw),a n -,W n -i,w n ). 


(13) 


Suppose now that the sequence a is strictly increasing; by (11), A n _i(a; w) < 
a n _x and G n _i(a; w) < a n _i; so by (11) and (13), A n _x(a; u£) < A n (a;w ) 
and G n -i(a]w ) < G n (a;w .), that is, the sequences of the means are also 
strictly increasing. 

(c) If now e n = (1 + l/n) n ,n = 1,... an essential step in showing that 
limn—oo e n = e is to show that the sequence e n ,n = 1,... is strictly 
increasing. We can use (G-A) to do this, using an argument of Melzak; 
[9]. Let 

a,\ = 1, a* = 1 H- -,k = 2 ,...,n; 


n — 1 


then 


n — 1 

G n (a) = ^1 + ——< A n (a ) = 


1 + (n- l)(i + — y) j 

--- = ! + -; 


n 


n 
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which is just e n _i < e n . 
(d) Now we prove that 


n 


(14) 


lim 
n—► oo 



= e. 


To do this it suffices to note that if 


k k 

Q i = 1 » ctfc = — l) fe_1 = efc-1 ’ ^ = 2, - - -, n, 

then 

n . . 

e « = „ /—r = G n (a). 

Vn! 

Since, as we have just remarked the sequence e n , n = 1,... increases strict¬ 
ly to the limit e, so also, by the above discussion, must the sequence 
e n , n = 1,... increase strictly to the same limit. 


§5. The Power Means 

Another mean of great antiquity is the harmonic mean; 

XJ ( \ _ W n 

Bl4I) 'K«i) + - + K/a n )’ 
or, with the above notation, 

H n (a^w) = ^4 n (a _1 ;u>)^ . (15) 

Then a simple application of (G-A fl ) gives 

^A n (a _1 ;tc)^ < ^<2„(a -1 ; w)^j = G n (a;w), 

or 

H n (a;w) < G n (a; w). (16) 

The relation (15) is analogous to (12) in that in (12) we use the inverse 
functions exponential and logarithm, while in (15) we use the reciprocal 
function twice, it being its own inverse. This suggests a generalization in 
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which we use the rth power function and its inverse the (l/r)th power 
function. This would give the rth power mean, —oo < r < oo, r ^ 0; 


MW= A,(s r ;s) 


1/r 


a\w\ H-(- a r n w n 

W n 


l/r 


Clearly then r = l,r = — 1 are just the arithmetic and harmonic means 
respectively. While by (12) the geometric mean has a similar form it does 
not seem to fit into this scheme. However, we will show that 

lim M[ r l(a; w ) = G n (a; w), (17) 

r—►O 

so that we could reasonably define 

= G n (a- w). 

To prove (17) first note that we can without loss of generality assume that 
a,i > 1,1 < i < n for if not consider A a, where A is large enough to ensure 
that A a,i > 1,1 < i < n, and note that Mj^(Aa;u>) = A Mn\a;,w). With 
this in mind consider 

r 

As r —^ 0 both the numerator and the denominator of the right hand side 
tend to zero so we can apply L’Hopital’s Rule to get: 


lim log mH(«; w) = lim ±1 ■■ + ”»<■^ 

_ „ 6 n V-,-/ r _ Q Wia r + . . . + Wna r 

wi log ai H-(- w n log a n 


)—>o 


Wn 

=A n (log a; w) = log G n (a; w), by (12) 

A similar line of reasoning will show that 

lim Ml^(a]W_) — max a lim = min a; 

r—KX> i—► — oo 

so it is reasonable to define Mii'^ajtu) and Mn °°^(a; u>) by these limit 
values. 

It is a simple exercise to extend (11) as: 
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( 18 ) 


if —oo < r < oo then 

(a; w) < (a; to) < M^(a; to), 

with equality only if £tj = • • • = a„; further (G-A ff ) and (16) give: 

Mj“^(a; to) < M^(a;«i) < Mj^(a;to), 

with equality only if ai = • • • = a n . 

In fact these last two sets of inequalities are just particular cases of a 
another generalization of (G-A); 
if — oo < r < s < oo then 

M [ n r \a- to) < M [ n s] (a ; to), (r; s ) 

with equality only if a\ = • • • = a n . 

There are many proofs of this fundamental result, see [1], [3], [4], [10], 

[11]. We show now that rewriting will reduce (r;s) to a few special cases. 

(I) The cases when one at least of r,s is not finite are already 
contained in (18). 

(II) If one of r or s is zero, and the other is finite, the result is 
equivalent to (G-A ff ). 

(i) (0; s)); put b = a s when, since s > 0, 

(0;s) G n (b l /';w) < l A n (b; w)j 

<=> G n (b-,w) < A n (b-,w). 

(ii) (r; 0); substitute b = a r when, since r < 0, 

/ \ 1 / r 

(r;0) <«=* l A „(&;to)J <G n (b 1 ' r -w) 

> > ' Gn(ll )! Ml) — A n (b; to). 

(III) The cases —oo <r<0<s<oo follow from (II) and 

(G-A ff ). 

(IV) The cases —oo < r < s < 0 follow from the cases 0 < r < 
s < oo. Suppose that r,s are negative and put p = — r, a = — s and 
b = 1/a, then 0 < a < o < oo and 

(wW(fcie)) < (nKtsl) 

M [ n a] (b] to) < M[ p] (b; to). 
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(V) The cases 0 < r < s < oo follow from the cases (1; #). If 
0 < r < s < oo put b = a r ,t = s/r , when t > 1 and, 

(r;s) <*=>• A n (b]w) < M$(b\w) <*=>• (1;<)« ( 19 ) 

This long but elementary argument reduces the consideration of (r;s) to 
(G-A ff ),(II), already discussed, and the single case (l;t),(V), (19). To 
complete the discussion we turn to yet another generalization. 


§6. Convex Functions 

Let us reconsider the proof of (B s ); the use of Taylor’s Theorem there 
allows us to prove much more. Suppose that (j> is any function that is twice 
differentiable in some open interval containing 1; if then 1 + y is in that 
interval Taylor’s Theorem says that for some z between 0 and y, 

<t>(\ + y) = <£( 1 ) + y<t>'{ 1 ) + 1 + z), ( T g ) 

an equation that is a direct generalization of (T). So now if 4> has a second 
derivative that is non-negative we can deduce the inequality 

^(l + y)>^(l) + ^ , (l); (20) 

and (B fl ) is just (20) in the case <j>(x) = x r ,r < 0,r > 1; further if the 
second derivative of (f> is positive the only case of equality in (20) is y = 0. 
If the second derivative is non-positive then the opposite inequality to (20) 
holds, with equality again only when y = 0 if this derivative is negative, 
By repeating the discussion of the first paragraph we get the following 
generalization of (1 9 ), for <j> with <f>" > 0; 

w(<*/0) > a4 i'(i) + /?(#i) - ^-(1)). 

The generality of (20) can be seen in that it contains 

e 1+y > 1 + y, y 7 ^ 0 

by taking <f>(x ) = e*; while taking <f>(x ) = logx, and using the opposite 
inequality to (20) we get 

log(l+y) < 1+ y, y^0. 
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It would seem that functions <f> for which <f>" > 0 are useful; they are 
called convex functions , while if <f>" > 0 we call them strictly convex. 
Strictly speaking the class of convex functions can be defined without 
assuming second order differentiability, but that is not important here, 
[16]. Functions <j> for which <f>" < 0, (< 0) are called concave, (strictly 
concave), functions. The importance of these classes is further enhanced 
by the fact that all such functions they satisfy an important inequality, 
Jensen’s Inequality. Let us see that this is the case. We will only discuss 
the case of convex functions, the cases of strictly convex, concave and 
strictly concave functions are essentially similar. 

If (f>" > 0 then <j>' is increasing so that if x < z < y we must have the 
following inequality between chord slopes 

<K Z ) ~ ¥, x ) < <Kv) ~ <K Z ) / 21 ) 

z - x ~ y - z 

This is because, by the Mean Value Theorem of Differentiation the left 
hand side of (21) is 4 >'(u) for some u, x < u < z, while the right hand side 
of (21) is 4 >'{y ) for some v , z < v < y, and u < v means that <J>'(u) < 

Now consider the simple identity between various chord slopes: 

<Kv) - <K x ) _ { y - z \ ( 4>( z )- g/(*) \ + { ^ / Hy) - 

y-x \y — x J \ z-x ) \y-x)\ y-z )' 

Since the coefficients on the right hand side add to 1 and are positive, this 
exhibits the chord slope on left hand side as a weighted arithmetic mean 
of the two chord slopes on the right hand side. So either immediately, or 
as an application of (11), we see that from (21) we get that 


<K z ) - ^0) < <Kv) - <K*) < <Ky) ~ ( 22 ) 

z — x — y — x ~ y — z 

Since we chose x,y,z with x < y < z but otherwise arbitrarily it follows 
that (21) implies that the chord slope ( <f>(y ) — </>(x))/(y — x) increases 
with either x or with y. In particular in (22) we let z tend to x , and 
independently let 2 tend to y we deduce that <f>'(x) < (f>'(y), or that </>' is 
increasing, that is <f>'' > 0, or <f> is convex. So that (21) is equivalent to 
convexity. 

Rewriting (21) we get that 


<Kz) < ^( 23 ) 

y — x y — x 
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now note that 


and that if 


y - z z - x 

z = - x H- y 

y — x y — x 


A = -—— then, 0 < A < 1, and 1 — A = ——— 


y - x 


y ~ x 


so that (23) can be written; 
if 0 < A < 1 


or 


<£(( 1 ~ ^) x + *y) < (1 - X)<f>(x) + \<f>(y)] 


<t>[ A 2 (x,y, 1 



<K x ),4>{y )\ 1 



(J) 


This is Jensen’s inequality, and as we have seen is equivalent to convexity; 
it has a simple geometric interpretation: the points on the chord are above 
the points of the graph see [3], p.27. The discussion of the strictly convex 
case would show then we can only get equality in (J) if x = y. 

To see the power of (J) note the following two cases: 

(a): <j>{x ) = x f ,t > 1, is a strictly convex function; so by (J) in this 

case 

^4 2 (z, y; 1 - A, A)^ < A 2 (x t ,y t ] 1 - A, A), 
or, with a slight change of notation 

A 2 (a; Mi) < M\ 


the special case n = 2 of (19). 

(b): 4 >(x) = logx, a strictly concave function; so by the opposite 
inequality to (J) in this case 


log^4 2 (x,y;l - A,A)^ > A 2 ^logx,logy, 1 - A,A^; 
or, by (11), again with a slight change of notation 


A 2 (a;w) > G 2 (a;w), 


65 







the special case n = 2 of ( G-A g ). 

To be completely useful we need to generalize (J) : 
if 4> is convex then 


<f>(A n (a;w) > A n (<f>(a ) ; w) 




further if (f> is strictly convex there is equality only in the case d\ = • • • = 

CL n • 

The proof is a simple application of induction and the special case 
(J); consider ( J g ), 

right hand side =A n (<£(a); w) = -A n _i(<ft(a); w) + ^-</>(a n ), by (13) 

by the induction hypothesis 

^ n -i(a; m.) + by (J) 

=(j){A n (a] w)) = left hand side. 

Now using ( J g ) we can use the arguments in (a) to complete the proof 
of (19), and of (b) to give another proof of (G-A 3 ). 

This is but an introduction to the many applications of convex func¬ 
tions and of Jensen’s inequality; see [3], [4], [8], [10],[11], [14], [16]. 


§7. Nanjundiah’s Inequalities 

It might be thought that no further generalizations of (B) are possible. 
However generalizations come in two types; those like (J 9 ) are extremely 
useful because of the many important special cases, those like the inequal¬ 
ities of Rado and Popoviciu are important because they give more insight 
into an important inequality. On of the most remarkable generalizations 
of the second kind is due to Nanjundiah,[13], remarkable and neglected. 
This is in part due to no proof having been published, the proof in [3], 
p.121, being wrong, as was pointed out in a private communication by 
Alzer. Professor Nanjundiah has told the author that he hopes to publish 
his proofs in the near future. 
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What are these remarkable results. For simplicity we will only con¬ 
sider the case of equal weights. Given a sequence of positive numbers, a 
the rth power means of the first n terms of this sequence define a new 
sequence, 

M w = (Mj r| ( a), .AfM(a),...); 

the cases r = 0,1 have already been used in section 4. 

In terms of these sequences the inequality (r;s) says that if —oo < 
r < s < oo then the sequence dominates the sequence M [r] , or 

M [r] < M [s] . 

Suppose now we take the smaller rth power mean of the larger sth 
sequence, how does it compare with the larger sth power mean of the 
smaller rth sequence; or are the two comparable? The remarkable result 
of Nanjundiah is 

AfW (M [s] ) > M [ n s] (M [r] ). ( N ) 

To get some idea of just how remarkable a result (N) is we consider first 
the special case n = 2,0<r<s and put ai = x,ai = y, when (N) 
becomes 


( : 


+ ( 


X * + V\r/s 


\l/ r fX k 

) H- 


+ 


l/s 


(24) 


We simplify this along the lines of our discussion of (r;s); put a = x r , b = 
y r ,t = s/r when of course t > 1 and 


(24) 



now putting z = b/a we reduce (21) to the equivalent 


1 

2 




i/< 


It turns out that this can be reduced to (J) ; put T(z ) = (1 + z*)/2, and 

1 + Z 1 /* 

S(z ) = ---, then the last inequality can be written as 


S o T(l) + S o T(z) 

2 


>s.r(l±£), 


(25) 


67 
















where S o T(z ) = S(T(z )) = (1/2)(1 + (1 + z t )/2) 1 / t . Simple calculations 
show that S o T" > 0, so that S o T is convex and (25) is jus a case of (J). 
In a similar way ifn = 2, r = 0<s then, with ax = x, <*2 = y, (N) is 


just 



x + y/xy 
2 


or, putting z = y/x 



1 + \fz 


which is immediate, or can be considered a case of the inequality (1;2). 

Similar proofs can be given in the n = 3 case, but for the general case 
a new idea along the lines suggested in [13], is needed. 

Finally let us see how (N) can imply other well known inequalities. 
The case of r = 0, s = 1 is 


G n (A) > A n (G), 


or 


„//<*i + 02 \ ,cq + • • • + a n \ ^ °i + v/ a i a 2 + • • • + \Za\02 • • • a n 

V ( 2 ^'' • <-n-) - - n -• 

This can be rewritten as 

n r—r \J d\ (^i + C12 ) * * * (d\ + ‘ + Q>n) > <*1 + y/d\CL2 + • * • + \Jd\0,2 * * • d n . 

vn! v 

(26) 

By (G-A) 

\/&i(ai + <22) • • • (&1 + • • • + dn) < ai + • • • + by (G-A), 


so using (14) inequality (26) implies the weaker 


a \ + \fd\d2 + • • • + \/d\a2 ■ • • d n < e(ai + • • • + a n ), 


or, in series form 

oo oo 

^ ^ Cr n (ci) < e ^ ^ a n , 

n=l n=l 

a result known as Carleman’s Inequality. 
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